Learning-Based Three Dimensional Sound Localization Using a Compact Non-Coplanar Array of Microphones

نویسندگان

  • Kamen Y. Guentchev
  • John J. Weng
چکیده

One of the various human sensory capabilities is to identify the direction of perceived sounds. The goal of this work is to study sound source localization in three dimensions using some of the most important cues the human uses. Having robotics as a major application, the approach involves a compact sensor structure that can be placed on a mobile platform. The objective is to estimate the relative sound source position in three dimensional space without imposing excessive restrictions on its spatio-temporal characteristics and the environment structure. Two types of features are considered, interaural time and level differences. Their relative effectiveness for localization is studied, as well as a practical way of using these complementary parameters. A two-stage procedure was used. In the training stage, sound samples are produced from points with known coordinates and then are stored. In the recognition stage, unknown sounds are processed by the trained system to estimate the 3D location of the sound source. Results from the experiments showed under ±3° in average angular error and less than ±20% in average radial distance error.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

N-dimensional N-microphone sound source localization

This paper investigates real-time N-dimensional wideband sound source localization in outdoor (far-field) and lowdegree reverberation cases, using a simple N-microphone arrangement. Outdoor sound source localization in different climates needs highly sensitive and high-performance microphones, which are very expensive. Reduction of the microphone count is our goal. Time delay estimation (TDE)-b...

متن کامل

Proceedings of Meetings on Acoustics

We proposed a sensing method of three-dimensional (3D) sound-space information based on symmetrically and densely arranged microphones mounted on a solid sphere. We call this method SENZI (Sakamoto et al., 2008). In SENZI, the sensed signals from each of the microphone is simply weighted and summed to synthesize a listener's HRTF, reflecting the listener's facing direction. Weighting coefficien...

متن کامل

Three-dimensional Sound Field Reproduction and Recording Systems Based on Boundary Surface Control Principle

Based on the boundary surface control (BSC) principle, a new recording/reproduction system is developed to realize high fidelity three-dimensional sound field reproduction. Theoretically, using this new system, perfect sound field reproduction can be achieved in any acoustic environment. Sound recording / reproduction systems based on the BSC principle require many loudspeakers and microphones....

متن کامل

Iterative Spatial Probability Based Sound Localization

A two-dimensional sound localization system using an iterative spatial probability (ISP) algorithm is proposed. With the ISP algorithm, sound signals from two microphones are iteratively crosscorrelated and the time index of the strongest correlation is recorded in a histogram. This information is then used to create a spatial probability map, which makes it possible to accommodate any number o...

متن کامل

Active stereo sound localization.

Estimating the direction of arrival of sound in three-dimensional space is typically performed by generalized time-delay processing on a set of signals from a fixed array of omnidirectional microphones. This requires specialized multichannel A/D hardware, and careful arrangement of the microphones into an array. This work is motivated by the desire to instead only use standard two-channel audio...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1998